The Classi cation Problem in Relational Property Testing
نویسنده
چکیده
In property testing, we desire to distinguish between objects that have a given property and objects that are far from the property by examining only a small, randomly selected portion of the objects. Property testing arose in the study of formal verication, however much of the recent work has been focused on testing graph properties. In this thesis we introduce a generalization of property testing which we call rela-tional property testing. Because property testers examine only a very small portion of their inputs, there are potential applications to eciently testing properties of massive structures. Relational databases provide perhaps the most obvious example of such massive structures, and our framework is a natural way to characterize this problem. We introduce a number of variations of our generalization and prove the relationships between them. The second major topic of this thesis is the classication problem for testability. Using the general framework developed in previous chapters, we consider the testability of various syntactic fragments of rst-order logic. This problem is inspired by the classical problem for decidability. We compare the current classication for testability with early results in the classication for testability, and then prove an additional class to be testable.
منابع مشابه
Distribution free bounds for relational classi cation
Statistical Relational Learning (SRL) is a sub-area in Machine Learning which addresses the problem of performing statistical inference on data that is correlated and not independently and identically distributed (i.i.d.) { as is generally assumed. For the traditional i.i.d. setting, distribution free bounds exist, such as the Hoe ding bound, which are used to provide con dence bounds on the ge...
متن کاملAlgorithms and Applications for Universal Quanti cation in Relational
Queries containing universal quanti cation are used in many applications, including business intelligence applications and in particular data mining. We present a comprehensive survey of the structure and performance of algorithms for universal quanti cation. We introduce a framework that results in a complete classi cation of input data for universal quanti cation. Then we go on to identify th...
متن کاملModerated Class membership Interchange in Iterative Multi relational Graph Classi er
Organizing information resources into classes helps signi cantly in searching in massive volumes of on line documents available through the Web or other information sources such as electronic mail, digital libraries, corporate databases. Existing classi cation methods are often based only on own content of document, i.e. its attributes. Considering relations in the web document space brings bet...
متن کاملThe Reengineering of Relational Databases Based on Key and Data Correlations
This paper addresses the problem of recovering object-oriented schemata from relational databases. Solutions to this problem are particularly useful for designing wrappers for federated database systems. Our goal here is to describe a reverse engineering methodology for the DOK federated database system (Tari et al. 1996), enabling the wrappers to express relational schemata as object-oriented ...
متن کاملData Mining using Nonmonotonic Connectionist Expert Systems
An application of Nonmonotonic Connectionist Expert Systems (NCESs) in mining classi cation rules from large relational databases is presented. NCESs are hybrid learning systems that can acquire symbolic knowledge of a nonmonotonic domain, represented using nonmonotonic inheritance networks. This initial knowledge can be re ned using connectionist learning techniques and a set of classi ed exam...
متن کامل